Micro-Video Popularity Prediction Via Multimodal Variational Information Bottleneck
نویسندگان
چکیده
As an emerging type of user-generated content, micro-video drastically enriches people's entertainment experiences and social interactions. However, the popularity pattern individual still remains elusive among researchers. One major challenges is that potential a tends to fluctuate under impact various external factors, which makes it full uncertainties. In addition, since micro-videos are mainly uploaded by individuals lack professional techniques, multiple types noise could exist obscure useful information. this paper, we propose multimodal variational encoder-decoder (MMVED) framework for prediction tasks. MMVED learns stochastic Gaussian embedding informative its level while preserves inherent uncertainties simultaneously. Moreover, through optimization deep information bottleneck lower-bound (IBLBO), learned hidden representation shown be maximally expressive about target compressive in features. Furthermore, Bayesian product-of-experts principle applied encoder, where decision keeping or discarding made comprehensively with all available modalities. Extensive experiments conducted on public dataset collect from Xigua demonstrate effectiveness proposed framework.
منابع مشابه
Deep Variational Information Bottleneck
We present a variational approximation to the information bottleneck of Tishby et al. (1999). This variational approach allows us to parameterize the information bottleneck model using a neural network and leverage the reparameterization trick for efficient training. We call this method “Deep Variational Information Bottleneck”, or Deep VIB. We show that models trained with the VIB objective ou...
متن کاملStochastic Variational Video Prediction
Predicting the future in real-world settings, particularly from raw sensory observations such as images, is exceptionally challenging. Real-world events can be stochastic and unpredictable, and the high dimensionality and complexity of natural images require the predictive model to build an intricate understanding of the natural world. Many existing methods tackle this problem by making simplif...
متن کاملRelevant sparse codes with variational information bottleneck
In many applications, it is desirable to extract only the relevant aspects of data. A principled way to do this is the information bottleneck (IB) method, where one seeks a code that maximizes information about a ‘relevance’ variable, Y , while constraining the information encoded about the original data, X . Unfortunately however, the IB method is computationally demanding when data are high-d...
متن کاملTowards Boosting Video Popularity via Tag Selection
Video content abounds on the Web. Although viewers may reach items via referrals, a large portion of the audience comes from keywordbased search. Consequently, the textual features of multimedia content (e.g., title, description, tags) will directly impact the view count of a particular item, and ultimately the advertisement-generated revenue. This study makes progress on the problem of automat...
متن کاملRecurrent Neural Networks for Online Video Popularity Prediction
In this paper, we address the problem of popularity prediction of online videos shared in social media. We prove that this challenging task can be approached using recently proposed deep neural network architectures. We cast the popularity prediction problem as a classification task and we aim to solve it using only visual cues extracted from videos. To that end, we propose a new method based o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Multimedia
سال: 2023
ISSN: ['1520-9210', '1941-0077']
DOI: https://doi.org/10.1109/tmm.2021.3120537